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The extraordinary potential of metagenomic functional analyses to identify activities of interest present in 
uncultured microorganisms has been limited by reduced gene expression in surrogate hosts. We have 
developed vectors and specialized E. coli strains as improved metagenomic DNA heterologous expression 
systems, taking advantage of viral components that prevent transcription termination at metagenomic 
terminators. One of the systems uses the phage T7 RNA-polymerase to drive metagenomic gene expression, 
while the other approach uses the lambda phage transcription anti-termination protein N to limit 
transcription termination. A metagenomic library was constructed and functionally screened to identify 
genes conferring carbenicillin resistance to E. coli. The use of these enhanced expression systems resulted in 
a 6-fold increase in the frequency of carbenicillin resistant clones. Subcloning and sequence analysis showed 
that, besides P-lactamases, efflux pumps are not only able contribute to carbenicillin resistance but may in 
fact be sufficient by themselves to convey carbenicillin resistance. 

Metagenomic libraries are gene libraries constructed from total DNA directly isolated from an envir- 
onmental source rather than laboratory cultures. The key advantage of these libraries is that they allow 
the access to unknown environmental genetic resources independently of our ability to cultivate the 
microorganisms encoding them. Metagenomic libraries can be analyzed by systematically sequencing all the 
genomic clones obtained, or by functionally screening clones for novel phenotypes conveyed to host bacteria from 
metagenomic sequences. 

Sequence-based gene identification relies on similarities to known gene sequences, while functional screening 
approaches have the advantage of being able to identify genes whose functions cannot be predicted by sequence 
analysis alone. However, a significant limitation of functional screening is that detection depends on efficient 
expression of the cloned genes. In fact, it has been shown that most genes are not usually expressed in the selected 
host bacterium 1,2 . Although some novel activities have been detected using E. coli as the surrogate host, increasing 
the efficiency of metagenomic gene expression in the bacterial hosts could greatly improve our ability to detect 
metagenomic clones encoding genes with novel functions. 

Once environmental DNA samples (metagenomic DNA) are cloned into multicopy expression vectors, 
their expression can be driven in heterologus systems using promoters adjacent to the cloning site. However, 
the ability to efficiently drive metagenomic gene expression inversely correlates with the size of the cloned 
DNA. A major limiting factor is the presence of transcription terminators upstream the gene of interest. High 
hit rates have been reported for metagenomic libraries with heterologous promoters and very short DNA 
fragments of 1-3 Kb long 3 5 . However, reducing the size of the cloned DNA fragments implies a lower 
probability of having a gene of interest in a given clone and, therefore, a higher number of metagenomic 
clones are required to cover the same length of total metagenomic DNA. The use of small clones may be 
successful in identifying activities that are selectable and depend on expression of a single gene but is unlikely 
to be suitable for non-selectable activities given the large number of metagenomic clones that would need to 
be screened. 

In order to overcome the inherent limitation of expressing genes in E. coli, the most frequent surrogate host for 
metagenomic libraries, a number of shuttle or broad host range vectors have been constructed with differing 
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degrees of success in allowing the transfer and screening of the meta- 
genomic library to different host bacteria 613 . Functional screening of 
metagenomic libraries using these vectors have shown that different 
positive clones can be obtained depending on the host bacteria used 
for the screening 14 . Although many of these methods represent an 
improvement, the expression of metagenomic genes still relies on 
their own expression capacity within the surrogate host. 

In this work, we used an alternative approach to increase the 
potential of functional metagenomic analysis. We report on the con- 
struction of specialized vectors and strains, which combine the use of 
E. coli as the surrogate host with two modified heterologous express- 
ion systems that incorporate viral components. One is based on the 
phage T7 RNA-polymerase that is insensitive to many of the bacterial 
transcription termination signals 1516 . The other expression method 
involves the use of the N anti- termination protein from the X phage 17 
coupled to a positively controlled bacterial regulatory system indu- 
cible by salicylate 18 . 

Results 

Construction of metagenomic vectors. In order to improve perfor- 
mance of the pCClFOS cloning vector, we modified it in three 
different ways: 1) insertion of an oriT to allow transfer of the meta- 
genomic library by conjugation, 2) the addition of a transcription 
system subject to anti-termination to improve metagenomic gene 
expression, and 3) incorporation of a promoterless gfp gene to 
allow detection of metagenomic gene expression. 

We started from a previous modification of the pCClFOS fosmid 
vector (Epicentre), consisting of the insertion of two Ceul sites flank- 
ing the metagenomic DNA cloning site {Ecd72l) (M. Ferrer, personal 
communication) . 

The oriT from plasmid RP4 was cloned into the unique Hpal site 
of pCClFOS-CeuI, thus generating pMP0561, to allow transfer by 
conjugation in triparental matings 19 . Next, a DNA fragment contain- 
ing the psal promoter regulatory sequence, the nah GHILNJK operon 
promoter and the lambda phage nut L site (N-utilization leftward site) 
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Figure 1 | Schematic diagram of the fosmids derived from pCClFOS- 
Ceul. Modifications resulting from this work, including oriT, psal, the 
nut L site and the promoterless gfp shown in the amplified region. 



was synthesized and cloned just upstream of the T7 gene 10 already 
present in pCCl 1FOS, to yield pMP0571 (Fig. 1). The nahGHILNJK 
operon is involved in naphthalene biodegradation and is activated by 
NahR in response to salicylate 18,20,21 . The nut L site (N-utilization 
leftward site) allows the phage transcription anti-termination pro- 
tein N to assemble with the transcription complex, thus making it 
insensitive to transcription termination signals 22,23 . This vector 
should allow transcription of cloned metagenomic DNA by two 
distinct RNA polymerases: (i) the T7 RNA polymerase, from the 
T7 gene 10 promoter, and (ii) the bacterial RNA polymerase modi- 
fied for processive anti-termination by the lambda phage N protein, 
from the psal promoter. 

An additional modification of pMP0571 to yield pMP0579 
involved cloning a promoterless gfp gene with a strong ribosome 
binding site from the T7 gene 10, downstream of the vector promo- 
ters and the metagenomic DNA cloning site (Fig. 1). Metagenomic 
DNA transcription in the appropriate direction can result in tran- 
scription continuing through to this promoterless gfp gene. This 
allows for the exploitation of Substrate Induced Gene Expression 
(SIGEX) technology 24 to identify regulatory systems, by detecting 
GFP fluorescence as an indication of active transcription of the fos- 
mid clones and sorting the appropriate clones by fluorescence 
assisted cell sorting (FACS). For pMP0571 and pMP0579 construc- 
tion details, see supplementary Methods. 

Construction of specialized strains for metagenomic DNA tran- 
scription. The MP0553 strain is a EPI300-T1 derivative, with the 
lacUVS promoter, the attenuator nasF and the gene-1 of the T7 
phage, coding for the T7 RNA polymerase, inserted into its trg lo- 
cus (Fig. 2. For MP0553 construction details, see supplementary 
Methods). In the absence of the lad repressor, such as in EPI300- 
Tl, the ZacUV5 promoter constitutively drives gene-1 transcription. 
The nasF attenuator is a transcription terminator signal within the 
nitrate assimilatory nasFEDCBA operon, subjected to transcriptional 
attenuation 25 , which has been used very successfully to reduce ex- 
pression vector transcription levels 26,27 . Therefore, this strain should 
produce constitutive but low levels of T7 RNA polymerase. 

The MP0554 strain has an insertion in trg containing the activator 
nahR gene and the psal promoter followed by the lambda gene N 
(Fig. 2. For MP0554 construction details, see supplementary 
Methods). Upon induction with salicylate, NahR would activate psal 
in the fosmid vector, thus activating transcription of the metage- 
nomic DNA, and simultaneously psal in the bacterially-inserted 
construction, thus activating N protein-dependent processive anti- 
termination e at transcripts initiated from psal in the vector. MPO- 
555 is a strain essentially the same as MP0554 but with a frameshift 
in the gene N. 

Functionality tests of the vectors and specialized strains. Tri- 
parental matings were prepared in which EPI300-T1 carrying 
different plasmids were used as donors, rifampicin resistant (Rif R ) 
or nalidixic acid resistant (Nal R ) derivatives of EPI300-T1 were used 
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Figure 2 | Schematic diagram of the specialized strains derived from 
E. coli EPI300™-T1, showing the DNA fragment that has been integrated 
into their trg locus. 
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as recipients, and DH5tx bearing pRK2013 used as the helper strain. 
pCClFOS-CeuI could not be transferred by conjugation (mating 
frequency <10~ 7 ). However, pMP0579 was transferred to all the 
recipient strains with frequencies above 10%, representing similar 
or higher frequencies to those obtained with the well-known 
mobilizable plasmid pBBRlMCS-3 (not shown). 

Transcription levels from the heterologous promoters present in 
the modified vectors were tested in pMP0579, which contains the 
promoterless gfp reporter gene downstream of the metagenomic 
cloning site. To test whether transcription initiated at the heterolog- 
ous promoters could proceed through transcription terminators, a 
2.5 Kb DNA fragment bearing the thnL transcription terminator 
from Sphingomonas macrogolitabida strain TFA 28 was cloned into 
the Eco 721 site of pMP0579 to generate pMPO580. 

Fluorescence of the EPI300-Tl/pMPO579 strain was low (about 
1,300 F.U., similar to the intrinsic fluorescence of EPI300-T1 R ). The 
addition of arabinose and salicylate resulted in a small increase in 
GFP expression (Fig. 3). In the specialized strain that produces the 
transcriptional activator NahRbut not the antitermination protein N 
(MP0555), the basal level of gfp expression from pMP0579 was also 
low. Increasing the plasmid copy number by adding arabinose 
resulted in a modest increase in expression levels, but much higher 
levels of expression were obtained when both arabinose and salicyl- 
ate were added to the culture medium together (Fig. 3). This shows 
that psal in pMP0579 is functional and that MP0555 produces the 
required transcriptional activator NahR. In contrast, when using 
pMPO580, which bears the thnL terminator, induction in the pres- 
ence of arabinose and salicylate, was almost negligible, demonstrat- 
ing that the transcription terminator was functional. Interestingly, 
gfp transcription from pMPO580 in MP0554, which expresses both 
NahR and N genes, was almost as high as that obtained with 
pMP0579. Taken together, these results indicate that transcription 
from psal in pMPO580 is terminated at the thnL terminator before 
reaching the gfp gene, but that transcription can continue onto the 
gfp gene if production of the anti-termination protein is also induced 
by salicylate. 

In the MP0553 strain, which constitutively produces low levels of 
T7 RNA-polymerase, expression levels were very low in the absence 
of arabinose (Fig. 4). However, GFP expression dramatically 
increased to even higher levels than those obtained from the psal 
promoter when the plasmid copy number was increased by addition 
of arabinose. This clearly shows that the T7 promoter in pMP0579 is 
functional and that the specialized MP0553 strain produces T7 
RNA-polymerase. The gfp expression level from the plasmid pMPO- 
580, which contains the thnL terminator, was higher than that from 
pMP0579, indicating that this terminator does not terminate tran- 
scription by the T7 RNA-polymerase. 

A reconstruction of the SIGEX technology 24 was performed by 
mixing cultures of MP0554 Rif R and MP0554 Nal R carrying the 
plasmid pMP0579. Only the MPQ554 Nal R /pMPQ579 culture was 
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Figure 3 | Expression of the gfp gene from pMP0579, or its derivative 
pMPO580 bearing the thnL terminator, in the strains MP0554 that 
produces the activator NahR and the antitermination protein N, and 
MP0555 that is similar to MP0554 but bears a frameshift in the gene N. 
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Figure 4 | Expression of the gfp gene from pMP0579 or its derivative 
bearing the thnL terminator pMPO580, in the strain MP0553 that 
constitutively produces low levels of T7 RNA-polymerase. 

pre-induced with salicylate to transcribe gfp from the standard psal 
catabolic promoter. Cultures were mixed in the 1:1,1:10 and 10:1 
ratios of Nal R :Rif R , purified by FACS and plated for subsequent 
resistance analysis to establish the ratios of induced vs. non-induced 
bacteria recovered after sorting. At the 1 : 1 Nal R :Rif R ratio, all the 
bacteria recovered after sorting were Nal R (induced); at the 1 : 10 
Nal R :Rif R ratio, the proportion of induced bacteria increased from 
0.01% to approximately 30% after sorting; finally, at the 1 : 10 
Nal R :Rif R ratio this proportion increased from 0.001% to approxi- 
mately 15% after sorting. Thus, FACS enriched the induced strain 
10 4 -fold. 

Construction of a metagenomic library and identification of clones 
conferring carbenicillin resistance. The plasmid pMP0579 was 
used to construct a metagenomic library from a shore at Punta 
San Garcia, Cadiz, Spain contaminated with crude oil from a 
tanker spill. The library comprises approximately 2 Gigabases distri- 
buted across approximately 54,000 different clones, and is main- 
tained in the strain EPI300-T1. 

Resistance to (3-lactams are predicted to be abundant in soils even 
in the absence of anthropogenic selection pressure 2930 . Because of 
this and because it is easily selectable, resistance to carbenicillin was 
chosen as the activity of interest to be identified in the transferred 
metagenomic clones. 

Direct selection of the EPI300-T1 culture hosting the library for 
carbenicillin-resistant clones resulted in the isolation of just a few 
tiny colonies after 2 days of incubation with a frequency much lower 
than 1/54,000. This indicated that none of the fosmids was efficiently 
conferring resistance to CblOO to EPI300. 

Triparental matings were performed using the EPI300-T1 hosting 
the bulk metagenomic library as the donor culture, each of the nali- 
dixic acid derivatives from EPI300-T1, MP0553 and MP0554 as 
recipient strains, and DH5tx/pRK2013 as the helper strain. We 
observed a very high conjugation frequency of over 6%, regardless 
the recipient strain. This indicates that the original pMP0579 vector 
and the metagenomic clones can be transferred to other strains with 
similar efficiency, despite the approximately 40 Kb of metagenomic 
DNA that each clone is expected to carry. 

An approximately 6-fold increase in transconjugants resistant to 
100 mg L _1 carbenicillin (Cb R ) were obtained with the specialized 
strains (6X 10~ 5 positives/transconjugants) versus the conventional 
EPI300-T1 NalR strain. In the case of the MP0554 strain, this 
increase was dependent on the addition of salicylate, which induces 
transcription activation from psal by NahR and anti- termination by 
the N protein (not shown). 

A total of 6 different fosmids were isolated from the 100 transcon- 
jugants derived from NalR derivatives of EPI300-T1, MP0553 or 
MP0554. All EPI300-T1 Cb R transconjugants contained the same 
fosmid, whose restriction pattern matched that previously identified 
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Table 1 | Mating transfer of the carbenicillin resistant fosmids. Values represent the frequency of carbenicillin resistant transconjugants 
relative to those transconjugants that had received the fosmid 
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in the clone directly selected from the library. This fosmid, desig- 
nated ETN1, was also found after the direct selection of carbenicillin 
resistant clones from the culture hosting the original library, and 
among the transconjugants derived from the specialized strains. 
The use of the strain producing T7 RNA polymerase allowed iden- 
tification of 3 additional fosmids, TN2, TN3 and TN4. These fosmids 
were also identified among the transconjugants derived from 
MP0554, the specialized strain bearing the N anti-termination sys- 
tem. This strain allowed identification of 2 additional fosmids, N5 
and N6, which were not detected using the other strains. 

To confirm the capacity of each fosmid to confer Cb R to each of the 
recipient strains, the six fosmids were transferred back to the EPI300- 
Tl Rif R derivative and the resulting strains were used as donors in 
triparental matings together with the Nal R derivatives as recipient 
strains and the DH5a/pRK2013 helper strain. As shown in Table 1, 
only the fosmid ETN1 conferred Cb R to EPI300-T1 Rif R . In addition, 
the carbenicillin resistance provided by this fosmid was apparently 
limited since only one tenth of the transconjugants exhibited suf- 
ficient resistance to generate a colony on LB-agar containing carbe- 
nicillin. The fosmids ETN1, TN2, TN3 and TN4 provided Cb R to the 
strain MP0553 that produces T7 RNA-polymerase, and all 6 fosmids 
conferred Cb R to the strain MP0554. These results using the isolated 
fosmids are in full agreement with those obtained using the bulk 
metagenomic library and confirm that at least six fosmids of the 
library potentially encode carbenicillin resistance genes. However, 
only a small fraction (one out of six) of the fosmids actually expressed 



the resistance gene by itself and conferred Cb R in the EPI300-T1 
strain. 

Gene identification in fosmids providing carbenicillin resistance. 

Sequence analysis of each fosmid revealed a number of genes that 
may confer resistance to carbenicillin, whose locations and 
orientations are shown in Figure 5. 

ETN1 encodes a class-C PTactamase that is oriented opposite to 
the heterologous promoters. This PTactamase is most similar to 
homologues found in different Pseudomonas species. Immediately 
downstream of the blaC gene is a 103 amino acid open reading frame 
containing a 4 nucleotide overlap with the blaC 3 'end. The overlap 
may be indicative of translational coupling, suggesting that the short 
Orf could be related to the PTactamase function. The blaC gene 
either with or without the short orf was subcloned into pMP0579 
in the same orientation as the heterologous promoters. Although 
more tolerant than the strain without the plasmid, most of the trans- 
conjugants receiving any of these subclones were unable to produce a 
colony in the presence of Cb 10 o, which indicated that the blaC alone 
cannot account for all the carbenicillin resistance provided by the 
original fosmid. 

ETN1 also codes for the 3 components of an efflux pump of the 
resistance-nodulation-cell division (RND) permease family. The fos- 
mid also contains a fourth component encoding an additional outer 
membrane protein. Since efflux pumps have been shown to contrib- 
ute to the tolerance of bacteria to different toxic compounds such as 
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Figure 5 | Schematic diagram showing the clones that confer carbenicillin resistance. All putative orfs and their orientations are shown although only 
those putatively involved in carbenicillin resistance are highlighted. Genes are not drawn to scale. 
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aromatic compounds or antibiotics 31 , this efflux pump could also 
contribute to the carbenicillin resistance. 

In the TN2 fosmid a class-B P-lactamase gene was found 6 Kb 
downstream of the heterologous promoters in the same orientation 
and a blaR gene encoding a LysR-type transcriptional activator just 
upstream in the opposite direction. The remaining sequence codes 
for hypothetical proteins or proteins with unrelated functions. In 
spite of blaR, this blaB gene could not be autonomously expressed 
in E. coli, though it was transcribed from any of the heterologous 
promoters present in the vector. 

TN3 and TN4 have largely common inserts. In fact, the TN4 insert 
is included within the insert of TN3. Neither of these fosmids contain 
Orfs putatively coding for a P-lactamase enzyme. The only genes 
potentially involved in Cb resistance in these fosmids code for the 
components of an RND-type efflux pump that are located in the 
same orientation 16 Kb or 13 Kb from the heterologous promoters, 
depending on the fosmid. A DNA fragment encoding just the 3 
components of the efflux pump was subcloned from TN4 into the 
pMP0579 vector in the appropriate orientation and the resulting 
plasmid, pMP0586, transferred to MP0554. The subclone was able 
to provide resistance to Cb 10 o although only when salicylate is pre- 
sent, just as with the original fosmid. This indicates that this efflux 
pump is able to provide carbenicillin resistance by itself if adequately 
expressed from a heterologous promoter in E. coli. 

N5 and N6 also share most of their insert DNA, which is cloned in 
opposite orientations in each fosmid. In N6, a class-B P-lactamase 
gene was found 1 1 Kb downstream from the heterologous promoters 
and transcribed in the same orientation. Presumably, autonomous 
expression of this gene is not efficient enough, and requires hetero- 
logous transcription from the psal promoter to provide full resistance 
to Cb 100 in E. coli. Intriguingly, in N5 this gene is located in the 
opposite orientation but N5 also requires salicylate induction to 
confer carbenicillin resistance. A possible explanation is that hetero- 
logous transcription in N5 results in expression of a gene coding for 
an activator of the blaB gene. A candidate activator gene, located 
27 Kb away from the psal promoter is shown in Fig. 5. 
Alternatively, some additional open reading frames in the same ori- 
entation as psal could be responsible for carbenicillin resistance in 
N5. There are 5 other orfs in the same orientation as psal but none of 
the predicted amino acid sequences provide any clue regarding their 
potential function. 

Discussion 

Functional metagenomic technology is a powerful tool that has been 
applied to the discovery of novel natural products and enzymes of 
biotechnological interest 32 . The potential of functional metage- 
nomics has been hampered by inefficient metagenomic gene expres- 
sion in heterologous hosts. Although gene expression is a multi-step 
process involving transcription, translation and post-translational 
modifications, transcription is the most frequently regulated process 
and is probably the main limiting step of metagenomic gene express- 
ion in surrogate hosts. Because of that, the aim of this work was to 
improve metagenomic gene transcription. Here we have shown that 
heterologous gene transcription is an effective way of increasing the 
efficiency of metagenomic gene expression provided that transcrip- 
tion can proceed along the metagenomic DNA. To prevent transcrip- 
tion termination at metagenomic termination signals, we have taken 
advantage of two phage expression systems that are much less sens- 
itive to the bacterial transcription termination signals. 

The N anti- termination system is inducible allowing metagenomic 
gene expression to be regulated. In addition, absolute expression 
levels can be modulated since salicylate-inducible system drives dif- 
ferent expression levels in proportion to the salicylate concentra- 
tion 33 . This feature may be of particular relevance when expressing 
functions that might be deleterious to the surrogate host 34 . 



On the other hand, the T7 expression system constructed in the 
strain MP0553 is constitutive, although polymerase expression 
levels are kept low by the nasF attenuator inserted between the 
lacUVSp and the gene-1. Expression levels from the T7 promoter 
are still 3-fold higher than those obtained from the psal promoter. 
However, in the case that overexpression of the screened function has 
a deleterious effect on bacterial growth, expression can be reduced by 
not increasing the plasmid copy number (Fig. 4). 

The metagenomic library constructed with DNA from a contami- 
nated coastal soil resulted in 54,000 independent clones. Although, 
metagenomic libraries with higher number of clones have been 
reported using the original vector, the number of clones depends 
on the quality of the DNA samples and, although we did not test 
whether during pCCl-FOS modifications we had inadvertently 
altered the cos site of the vector, we did not alter this sequence and, 
a priori, there is no reason to think that the modified vector had lower 
packaging efficiency than the the original vector. Testing heterolog- 
ous gene expression involved transferring the metagenomic library 
by mating to the specialized strains. This way, a total of 6 clones 
conferring carbenicillin resistance were isolated from a metagenomic 
library containing 2 Gb of metagenomic DNA. Although this ratio of 
positive clones may look low, it is comparable to those obtained with 
other metagenomic libraries 5,30 . In addition, the diversity and abund- 
ance of antibiotic resistance determinants can also be very different 
depending on the DNA source. It is important to bear in mind that 
our metagenomic DNA comes from a soil contaminated by an oil 
spill, and therefore the biodiversity and relative abundance of anti- 
biotic resistance genes may be lower than that of a non-contaminated 
soil. Theoretically, this additional mating step might impose some 
bias against fosmids that in some way reduce the growth rate of the 
bacterial host, thereby reducing the recovered diversity after the 
mating. However, at least in the case of carbenicillin resistance deter- 
minants, this additional step did not prevent the isolation of the 
fosmid already identified by direct selection using the original lib- 
rary, but rather increased recovered diversity since it allowed the 
identification of 5 additional fosmids. In any case, in order to prevent 
this potential bias, it is possible to directly construct the metage- 
nomic library using a specialized strain such as MP0554 as the 
surrogate host. 

Although the N anti-termination system allowed identification of 
some fosmids that the T7 expression system could not, both systems 
allowed metagenomic gene transcription from promoters located at 
least 16 Kb upstream, and through a number of genes in the opposite 
orientation to the promoter (see fosmid TN3). Although other fac- 
tors may influence gene expression levels and may preclude iden- 
tification of other positive clones present in the metagenomic library, 
it is clear that transcription is a major limiting factor for the func- 
tional analyses of metagenomic libraries, which can be circumvented 
by the phage-based heterologous expression systems. 

The vector also includes a promoterless gfp downstream of the 
metagenomic cloning site to exploit the SIGEX technology, which 
has considerable potential for the identification of inducible genes 
and regulatory systems. In spite of the potential problems that may 
prevent the identification of a regulatory system by using this type of 
screen 35 , there are ways of avoiding them 36 . Our reconstruction of 
SIGEX technology indicates that a bacterial subpopulation expres- 
sing genes from a conventional catabolic promoter such as psal can 
be detected and enriched 10 4 -fold by FACS. This enrichment is suffi- 
ciently high to detect positive clones that were present in a metage- 
nomic library with a frequency of 10~ 6 or even lower, therefore, 
enough to detect any positive clone that is present in a metagenomic 
library. 

Besides the relatively low permeability of the outer membrane of 
gram negative bacteria, there are 3 main mechanisms for conferring 
tolerance to pTactam antibiotics 37 . Among the resistant clones we 
have found genes coding for pTactamases and for efflux pumps 
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Table 2 | Bacterial strains and plasmids used in this work 



Strain Genotype/Phenotype Reference/Source 



E. coli EPI300-T1 R F" mcrA A(mrr-hsdRMS-mcrBC) (Str R ) 4>80d/acZAMl 5 AlacX74 recAl enc/Al araDl 3 9 A{ara, Epicentre 

/eu)7697 galU ga/KAr rpsL nupG trfA tonA dhfr 

E. coli MP0553 EPI300-T 1 R Atrg::Pi aUVS -nasF a\\enua\or-gene] This work 

E. coli MP0554 EPI300-T1 R A/rg:: naf)R/P so/ -geneN This work 

E. coli MP0555 EPI300-T1 R Atrg:: nahR/V sar - truncated gene N This work 

E. coli DH5a (p80d/acZAMl 5 A{lacZYA-argF)U 1 69 recAl endAl hsdRl 7 (nf m k + ) supEAA thi-1 gyrA re/Al 43 

Plasmid Genotype/Phenotype Reference/Source 

pCCl FOS-Ceul Cm R , pCCl FOS (Epicentre) with two Ceul sites flanking the cloning site Eco72l M. Ferrer 

pMP027 Ap R , expression vector with rrnBTl T2-Pm-nasF attenuator: :MCS, ColEl replication origin 26 

pCNB4-S2 Ap R Km R pUT/mini-Tn5 nahR/Psal-^sylS2 ' 44 

pMP0556 Ap R Piacuvs cloned into pBluescript II SK+ This work 

pMP0557 Ap R , nasF attenuator cloned into pMP0556 This work 

pMP0558 Ap R Km R , kanamycin resistance gene from pKD4 cloned into pMP0557 This work 

pMP0559 Ap R Km R Cm R , chloramphenicol resistance gene from pKD3 cloned into pGPl-2 This work 

pMP0561 Cm R , pCCl FOS-Ceul derived fosmid with or/T This work 

pMP0563 Ap R , nahR-P na h-Psal cloned into pBluescript II SK+ This work 

pMP0564 Ap R Cm R , chloramphenicol resistance gene from pKD3 cloned into pMP0563 This work 

pMP0565 Ap R Cm R , gene N cloned into pMP0564 This work 

pMP0571 Cm R , pMP0561 derived fosmid with P sa /-nuft cloned downstream T7 promoter and the cloning site Eco72l This work 

pMP0575 Ap R Cm R , pMP0565 derived plasmid with a frameshift in gene N This work 

pMP0579 Cm R , pMP0571 derived fosmid with a promoterless gfp cloned downstream the cloning site Eco72l This work 

pMPO580 Cm R , pMP0579 derived fosmid with the thnL transcription terminator from Sphingomonas This work 

macrogolitabida strain TFA cloned in the Eco72l site 

pMP0586 Cm R Cb R , efflux pump from TN4 subcloned in pMP0579 This work 

pMP0634 Km R , plasmid containing gfp gene 45 

RP4 Km R Ap R (Tnl)Tc R lncP-l Tra + Cma" 46 

pNK736 Ap R , plasmid containing Pi ac uvs 47 

pBluescript II SK+ Ap R , phagemid vector Stratagene 

pBluescript II KS+ Ap R , phagemid vector Stratagene 

pBBRlMCS-3 Tc R , mobilisable plasmid 48 

pRK2013 Km R , helper in triparental matings 19 

pGPl-2 Km R , plasmid containing gene-1 (T7 RNA polymerase) 49 

pKD3 Ap R Cm R OriRy ' 50 

pKD4 Ap R Km R OriRy 50 

pKD46 Ap R , oriR 1 0 1 repA 1 0 1 (ts) araBp-gam-bet-exo 50 

pCP20 Ap R Cm R Ts (30°C) " 51 



(Fig. 5) but not penicillin binding proteins (insensitive transpepti- 
dases). Of the efflux pumps, those belonging to the RND (Resistance- 
Nodulation-Division) are the only ones that exist in a tripartite form 
traversing both the outer and inner membrane and can pump mole- 
cules from the cytoplasm, from the periplasm or even when assoc- 
iated to the inner membrane 31 . Although many RND pumps can 
accommodate |3-lactams, just a few have been definitely linked to 
(3-lactam resistance 38 . They are usually associated to other forms of 
resistance such as a )3-lactamase although beta-lactamase-negative, 
ampicillin-resistant (BLNAR) Haemophilus influenza isolates have 
been described that can tolerate up to 16 mg L _1 of ampicillin 3 '. In 
these cases, overexpression of the AcrAB efflux pump was associated 
with mutations in PBP3, one of the 5 penicillin binding proteins of H. 
influenzae, that lowered affinities of beta-lactams to PBP3. Previous 
metagenomic analyses of (3-lactam resistance resulted in large num- 
bers of clones bearing bla genes but no efflux pumps have previously 
been linked to bla genes or to beta-lactam resistance 40 . 

The way earlier metagenomic libraries were constructed may have 
prevented the identification of other resistance determinants in the 
past given that we have shown that some metagenomic RND pumps 
are not only contributing to carbenicillin resistance but are able to 
confer E. coli resistance to 100 mg L~' of carbenicillin by themselves 
(fosmids TN3 and TN4). 

METHODS 

Bacterial strains, growth conditions, plasmids and oligonucleotides. The bacterial 
strains and plasmids used in this study are described in Table 2. Luria-Bertani (LB) 
medium or LB-agar was used as the standard growth medium, and bacteria were 



grown at 37°C except for the screening and selection of fosmids conferring 
carbenicillin resistance that were carried out at 30 C C. When needed, antibiotics were 
used at the following concentrations: 12.5 ug mL"' chloramphenicol, 100 ug mL" 1 
carbenicillin, 15 ug mL" 1 nalidixic acid, 20 (ig mL" 1 rifampicin, 25 (ig mL" 1 
kanamycin and 100 p.g mL" ! ampicillin. The copy number of the fosmids in 
EPI300 1M -Tl and derivative strains was increased by the addition of 1 mM arabinose. 
5 mM salicylate was used as an inducer for transcription level analyses and 1 mM 
salicylate for the screens. 

DNA manipulations were performed according to standard procedures 41 . Details 
on the construction of fosmids and strains in this work are described in 
Supplementary Methods. 

Oligonucleotides are described in Supplementary Table Si. 

Conjugative matings. Vectors and metagenomic libraries were transferred by 
triparental matings' 9 with DH5a/pRK2013 as the helper strain. Conjugative matings 
were performed on LB-agar without antibiotic selection overnight at 30 C. The 
mating mixtures were then plated on LB-agar with the necessary antibiotics for the 
transconjugants selection, carbenicillin for the screening and arabinose for increasing 
the copy number of the fosmid, and incubated at 30°C for 48 h. 

Mating frequencies were estimated as the ratio of transconjugant clones of the 
recipient strain (chloramphenicol+rifampicin resistant or chloramphenicol + nali- 
dixic acid resistant clones) to total clones of the recipient strain (either rifampicin or 
nalidixic acid resistant, depending on the recipient strain). 

Functionality tests of the vectors and specialized strains. The transcription levels 
from the heterologous promoters present in the modified vectors were analysed by 
detecting the fluorescence levels from the promoterless gfp gene, which is used as a 
reporter gene. Analyses were performed in the multifunctional microplate reader 
POLARstar Omega (BMG LABTECH GmbH, Germany), with a microplate 
COSTAL 96 and the following parameters: excitation filter 485BP1, emission filter 
EM520 and gain 1000. 

Flow cytometry and sorting. Mixed bacterial cell suspensions were analyzed by flow 
cytometry using a FACScalibur cytometer equipped with an argon laser emitting at 
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488 nm. Bacteria were detected by side scatter (488/10 nm band pass filter) and the 
fluorescence intensity was detected using the FL1 channel {530/30 nm band pass 
filter). The Cellquest Pro software {Becton- Dickinson) was used for data acquisition 
and analysis. For cell sorting, flow cytometer water lines were pre-treated with a 
bleach solution (FACSClean) and autoclaved phosphate saline buffer (PBS) was used 
as sheath fluid. The exclusion mode was used to sort GFP- expressing cells. Sorting 
gates were sufficiently separated to avoid false positives/negatives. 

Construction of a metagenomic library. Bacteria were extracted by direct addition 
of 400 mL of disruption buffer (0.2 M NaCl, 50 mM Tris-HCl pH 8.0) to 160 g of the 
soil sample and mixing overnight with shaking. The suspension was centrifuged at 
low speed (400 g for 3 minutes) and the supernatant poured over 10 mL of a 
Nycodenz resin solution {1.3 g mL -1 ). Centrifugation in a gradient of the Nycodenz 
resin (Axis-Shield) was carried out at 10,000 g X for 40 minutes at 4 C C. The bacteria- 
containing a band at the interface between the Nycodenz and the aqueous layer were 
recovered and mixed with PBS. The cells were pelleted by centrifugation at 10,000 g 
for 20 minutes, and re-suspended in TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0). 
DNA was then extracted with the G NOME DNA kit {MP Biomedicals). Using this 
method, we obtained 24 ug of DNA from 160 g of soil, with an average size of 
approximately 40 kb. 

The metagenomic library was constructed according to the CopyControl rM Fosmid 
Library Production kit protocol (Epicentre), except that pMP0579 was used instead 
of pCClFOS. pMP0579 was linearised by restriction with the Pmll enzyme {New 
England Biolabs, isoschizomer of Eco72V), dephosphorylated with Shrimp Alkaline 
Phosphatase {USB) and concentrated to 300 mg L~ ! with a Centrifugal Filter Device 
(Microcon, Millipore). 

Gene identification in fosmids conferring carbenicillin resistance. Each fosmid 
was sequenced with a Roche 454 GS FLX Ti sequencer (454 Life Sciences, Branford, 
CT, USA) at Lifesequencing S.L. (Valencia, Spain). Sequences were assembled using 
the Newbler GS De Novo Assembler v.2.3 (Roche). The assembled sequences were 
compared to those in the databases using the Blastx and Blastn programs 42 . Those orfs 
potentially involved in carbenicillin resistance are highlighted in Figure 5. 
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